A QlPfiP ? a 09/ 869926 

WO 00/41128 ^ l\Y' I _ ^ PCT/ILOO/00010 

JCt8RecmT/PT0 0 9 JUL 2001 

\ OPTICAL SCANNER AND SOFTWARE THEREFOR 



FIELD OF THE INVENTION 

The present invention is in the field of text scanners and software for use 
therewith. 

BACKGROUND OF THE INVENTION 

5 Hand-held optical scanners are used in a variety of applications including 

text scanning, barcode reading, and picture grabbing. These scanners comprise an 
optical sensor for acquiring image information that is managed by a central 
processing unit (CPU) and stored in a memory. The information is typically 
processed by character recognition software and then sent to a user application such 

10 as a word processor, database or spreadsheet 

The viewing area of hand-held scanners at a given instant includes only a 
small portion of the entire scanned medium. Thus in use, a hand-held scanner is 
made to pass over the entire scanned medium so as to produce a time series of 
acquired images where each acquired image corresponds to a small portion of the 

is scanned medium. Since the images are acquired at a constant rate, subsequent 
reconstruction of the scanned medium from the series of acquired images produces 
a distorted image due to variation in the scanning speed during scanning. A hand 
held scanner has a reference scanning speed at which the acquired image is 
identical to the original. The reference scanning speed is the product of the width 

20 (in the scanning direction) of the optical sensor and the rate at which images are 
acquired by the sensor. Locations in the scanned medium acquired when the 
scanning speed is below the reference speed appear to be stretched cr expanded, 
while locations acquired when the scanning speed is above the reference sreec 
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appear compressed. Such distorted images may not be recognizable by character 
recognition software. Each acquired image must thus be processed by rescaling it 
in the scanning direction by a correction factor equal to the instantaneous scanning 
speed divided by the reference speed. Prior art scanners therefore comprise means 

5 for continuously monitoring the instantaneous scanning speed. U.S. Patent 
5,083,2 18, for example, discloses a hand-held scanner comprising a wheel that rolls 
along the scanned medium during scanning for measuring the instantaneous speed 
at every moment during the scanning. U.S. Patent 5,023,922 discloses a hand-held 
scanner with a 2-dimensional optical sensor in which the instantaneous speed is 

10 calculated from the time interval required for an image to traverse the sensor. The 
presence of a wheel or a two dimensional sensor makes the hand held scanner 
bulky and awkward to use. 

There is therefore a need in the art to provide a scanner that substantially 
reduces or overcomes the disadvantages of prior art scanners. 
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GLOSSARY 

There follows a glossary of terms used in the following description and set 
of claims together with their definitions, some of them known in the art, others 
having been coined. 

Field of view - the portion of the scanned medium acquired by a scanner at one 
instant 



Scanned image - a two-dimensional image created by moving a scanner over a 
25 surface. The scanned image is compiled from the time series of fields of view 
acquired during scanning. 



Text image — A scanned image composed of text in any language. 
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Cluster -^A contiguous group of black pixels in a text image. A black cluster is 
typically a single character but may be a portion of a single character, or a union of 
two or more characters. 

Horizontal segment -A line of contiguous black pixels parallel to the text line, one 
pixel high. The length of a horizontal segment is the number of its black pixels. 

Stack - A contiguous array of at least a predetermined number of horizontal 
segments in a scanned image with the following properties: 

(a) No two of the horizontal segments are collinear. 

(b) The ratio of the length of the longest horizontal segment in the stack 
to that of the shortest horizontal segment does not exceed a 
predetermined value. 

(c) adjacent horizontal segments in the stack at least partially overlap (i. 
e. in a given pair of adjacent horizontal segments in the stack, there is 
at least one black pixel in each of the two horizontal segments 
acquired from the same field of view). 

Each stack has associated with it a height that may be defined, for example, as the 
number of its composite horizontal segments. Each stack also has associated with 
it a width that may be defined, for example, as the average length of its horizontal 
composite segments, the minim al length of its horizontal composite segments, or 
the length of an arbitrary one of its horizontal segments. The invention is not to be 
considered as being bound by these definitions, and other definitions of height and 
width are contemplated within the scope of the invention. Each stack also has 
associated with it a bounding rectangle of minimum dimensions in which two 
opposite sides are vertical and two opposite sides are horizontal. 

Essentially vertical stack - A stack whose height exceeds a predetermined number 
in which the ratio of the width of the stack to the width of its bounding rectangle 
exceeds a predetermined value. 
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Character recognition software - Any software package for converting a text 
image into a string of ASCII characters, for example Optical Character Recognition 
(OCR) software. 

Stop interval - A series of consecutive fields of view, the number of which exceed 
a predetermined number, all of which are nearly identical to the first field of view. 
Two fields of view are considered to be nearly identical if there does not exist in 
one of the fields of view a subfield of contiguous pixels of length greater than a 
predetermined length in which all of the pixels are different from the corresponding 
pixels in the other field of view. 

Font ratio - the ratio of the mode character height (the most common character 
height) to the most common essentially vertical line width of the characters in a 
particular font. The font ratio is a characteristic constant of a given font For most 
fonts, the font ratio is typically, although not necessarily, around 6. 

Correction factor- the instantaneous scanning speed divided by the reference 
speed. 

SUMMARY OF THE INVENTION 

In the following description and set of claims, the direction of the text line 
as well as the scanning direction of a hand held scanner is arbitrarily designated 
as the horizontal direction. This designation is being' made only for the sake of 
simplicity in describing the invention. Other scanning directions are also 
contemplated within the scope of the invention and the invention is not to be 
considered as being bound by this arbitrary designation. 

In its first aspect, the present invention provides a scanner having a 
one-dimensional (linear) optical sensor that creates an image one pixel wide and 
n pixels in length (height), and not having mechanical moving parts such as a 
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wheel. Aperies of scanned images is acquired at a constant rate determined by an 
internal clock inside the scanner when the scanner's sensor array is moved over a 
surface so as to scan a line of symbols. Image information obtained by the sensor 
may be stored and processed in a computer processing unit. 
5 In its second aspect the present invention provides a method for detennining 

the instantaneous correction factor of a text image during scanning by a hand held 
scanner. Since the instantaneous correction factor is equal to the instantaneous 
scanning speed divided by the reference speed of the scanner, the method is 
equivalent to a method for detennining the instantaneous scanning speed. The 

10 method may be used with a scanner not having any mechanical moving parts. It 
may also be used with scanners having only a one-dimensional optical sensor. The 
method of the invention may be used for removing distortions in a text image due 
to variation in the scanning speed in order to make the text recognizable by optical 
character recognition software or readable as an image by a user 

15 The method of the invention is based upon the observation that in a 

scanned image, vertical distances (i.e. distances perpendicular to the scanning 
direction) at any particular location are independent of the instantaneous scanning 
speed at that location. In particular, in a text image, the height of a text character 
is independent of the scanning speed. The mode character height in a text image 

20 is therefore independent of any variability in the scanning speed during 
acquisition of the text image. As disclosed below, the instantaneous scanning 
velocity, or equivalently the instantaneous correction factor, at a location in a text 
image comprising an essentially vertical stack can be determined from the width 

* 

of the essentially vertical stack, the mode character height of the scanned text, 
25 and the font ratio. 

The invention thus provides a scanner for scanning a surface characterized 
in that it has a one-dimensional optical sensor and has no mechanical moving 
parts for detennining the scanning speed. 
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The invention also provides a method for processing an acquired text imaae 
obtained by scanning a text with a scanner, the text having one or more fonts, 
each font having a font ratio, each location in the text having been scanned at an 
instantaneous scanning speed, where the text image is distorted due variability of 
5 the instantaneous scanning speed, the method comprising the following steps: 

(a) Providing a mode character height for each font; and 

(b) utilizing said mode character height and font ratio for constructing a 
correction factor in order to correct the distorted text image. 

The invention further provides a storage medium storing an executable 
computer program for processing an acquired text image obtained by scanning a 
text with a scanner, the text having one or more fonts, each font having a font 
ratio, each location in the text having been scanned at an instantaneous scanning 
speed, where the text image is distorted due variability of the instantaneous 
scanning speed, the processing comprising the following steps: 

(a) Providing a mode character height for each font; and 

(b) utilizing said mode character height and font ratio for constructing a 
correction factor in order to correct the distorted text image. 

BRIEF DESCRIPTION OF THE DRAWINGS 

In order to understand the invention and to see how it may be carried out in 
20 practice, a preferred embodiment will now be described, by way of non-limiting 
example only, with reference to the accompanying drawings, in which: 
Fig. 1 shows a hand-held scanner according to the invention; 
Fig. 2 shows a block diagram of the processing carried out according to the 
invention; 
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Fig?3 shows 7 characters resolved into 8 clusters; 

Fig. 4, a and b each show a cluster decomposed into horizontal segments; 
Fig. 5a shows a line of text to be scanned; 

Fig. 5b shows a text image obtained by scanning the line of text of Fig. 5a 
5 with a hand-held scanner; 

Fig. 6 shows a flow chart for determining the instantaneous scanning speed 
according to the invention; 

Fig. 7 a-c show parts of an acquired text image; and 

Fig. 8 shows the acquired text image of Fig, 7 after correction in accordance 
10 with the invention. 

DETAILED DESCRIPTION OF A PREFERRED EMBODIMENT 

In its first aspect, the present invention provides a scanner having a 
one-dimensional optical sensor and not having mechanical moving parts such as a 
wheel. As shown in Fig; 1, the scanner, generally designated as 1, has a 

15 one-dimensional optical sensor 2, a handle 3 for grasping, and no mechanical 
moving parts. A series of scanned images is acquired by moving the sensor 2 
over a surface. Fig. 2 is a block diagram of a system comprising a 
one-dimensional scanner and a processor according to the invention. The sensor 
2 is used to scan a line of symbols. The sensor is linked to the input port of 

20 central processing unit 12. The input function is managed by a CPU 14. Image 
information 16 obtained by sensor 10 may be stored by the CPU in a memory 18. 
The structure of the system of the invention is not bound to the specific 
architecture depicted in Fig. 2. 

In its second aspect the present invention provides a method for determining 

25 the instantaneous correction factor of a text image during scanning by a hand held 
seamier. The method may be used with a scanner not having any mechanical 
moving parts. It may also be used with scanners having only a one-dimensional 
optical sensor. The method of the invention may be used for removing distortions 
due to variation in the scanning speed in the text image 1 6 acquired by the sensor 2 
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in order to make the text recognizable by character recognition software or by a 
user. The method, to be described below in detail, is carried out by an application 
20 that processes the image either in real time or off-line in respect of image a 
priori stored in memory 18. The results of the processing may optionally be 
5 processed by character recognition software before being returned to the CPU or 
sent to interface 22. 

Referring now to Fig. 3, 7 characters are shown resolved into eight 
clusters. Clusters 31, 35 and 38 each form an entire character (e, c and e, 
respectively). Cluster 32 is a union of two characters (ff). Clusters 33, 34, 36 

io and 37 each contain a part of a character (i). 

Fig. 4 shows two clusters divided into composite pixels. In Fig. 4a, 
horizontal line 49, for example, has a length of 6 (pixels), and horizontal line 40 
has a length of 8. Stack 42 has a height of 12. If horizontal line 40 were to be 
added to stack 42, the resulting composite structure would not be a stack because 

15 the ratio of the length of the longest horizontal segment (8) to that of the shortest 
horizontal segment (5) in the structure would exceed a typical predetermined 
value of 1.3. Two of the twelve horizontal lines constituting stack 42 designated 
as 45a and 45b have a length of 5, while the other ten have a length of 6. The 
width of stack 42 (the average length of its horizontal lines) is thus calculated to 

20 be 5.83. In Fig. 4b stack 43 has a height of 6 and a width of 28, which is equal to 
the cluster width. Stack 43 is not considered to be an essentially vertical stack 
because its height does not exceed a typical predetermined value of about 8. 
Stack 44 is composed of 13 horizontal line segments of lengths of 5-6. In this 
stack, there is no overlap between the lowest 46 and highest 47 horizontal line 

25 segments. Stack 44 is also not an essentially vertical stack because the ratio of its 
width (6.3) to the width of its bounding rectangle 48 (12) does not exceed a 
typical predetermined value of about 0.75. The above mentioned predetermined 
values used in reference to the clusters of Fig. 4, are by way of example only, and 
the invention is not bound by these predetermined values. 
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Fig? 5a shows a line of text to be scanned. Arrows 56 indicate the mode 
character height, and arrows 57 show the most common vertical line width. In 
some fonts, essentially vertical lines are slightly slanted from a true vertical 
position. The ratio of the mode character height 56 to the most common 
5 essentially vertical line width 57 (the font ratio) is a characteristic constant of the 
font 

Fig. 5b shows a typical text image obtained by scanning the text line in 
Fig. 5a with a hand-held scanner. The text appears distorted due to variation in 
the scanning speed. Character widths are inversely proportional to the scanning 

10 speed. As can be seen by comparing the width of characters 50 and 51 in the 
original text (Fig. 5a) with their images in the text scan (Fig 5b), in interval 54 
the instantaneous scanning speed was greater than the reference speed of the 
scanner, while in interval 55 the instantaneous scanning speed was below the 
reference speed. Vertical line widths, for example that of vertical line 59, are 

15 also inversely proportional to the scanning speed. The mode character height 56, 
however, is unaffected by the variability in the scanning speed and is the same in 
Figs. 5a and 5b. Horizontal line widths, for example, that of line 58, are also 
unaffected by variations in scanning speed. Interval 52 is a typical stop segment. 
Fig. 6 shows a flow chart describing the method in accordance with one 

20 embodiment of the invention for determining the instantaneous correction factor 
at different locations in ah acquired text image such as the one shown in Fig. 7. 
The font ratio for the font of the text from which the acquired text image shown 
in Fig. 7 was obtained is, by this example, about 6. The mode height of the text 
characters is determined 60, which, as stated above, is independent of any 

25 variability in the scanning speed The mode character height of the acquired text 
shown in Fig. 7 is indicated by double arrow 70 and is equal, by this example, to 
26 pixels. A reference width is calculated 61 which is equal to the mode 
character height divided by the font ratio. For the text image of Fig. 7, the 
reference width is thus calculated to be 26/6, or 4.3 pixels. Since the font ratio is 

30 defined as the ratio of the mode character height to the most common essential]-/ 
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vertical line width of the characters comprising the font , the reference width is 
the expected width of an essentially vertical stack when the scanning speed is 
equal to the reference speed. All essentially vertical stacks in the text image are 
then identified 63. Numerals 71-91 indicate all of the 21 essentially vertical 
stacks in Fig. 7. The widths of the essentially vertical stacks are determined 64. 
Essentially vertical stacks having a width greater than the reference width are 
those where the instantaneous scanning speed was below the reference speed of 
the scanner. Essentially vertical stacks having a width less than the reference 
speed are those where the instantaneous scanning speed was above the reference 
speed of the scanner. The correction factor at an image comprising an essentially 
vertical stack acquired when the scanning speed was below the reference speed is 
then obtained 65 by dividing the reference width by the width of the essentially 
vertical stack. For example, for essentially vertical stack 86, whose width is 
greater than the reference width, the correction factor is thus calculated to be 
4.3/5.5, or 0.8. The correction factor at an image comprising an essentially 
vertical stack acquired when the scanning speed was above the reference speed is 
1. 

The invention also provides for determining the correction factor at a stop 
segment in an acquired text image. A stop segment 102 is seen in Fig. 7. In stop 
segment 102, all of the fields of view are nearly identical to the first field of view 
in the segment, which is indicative of a location in the acquired text image where 
the scanning speed was close to 0. Stop segment 102 is divided into a 
predetermined number of contiguous subsegments 66. In Fig.7, stop segment 102 
was divided into 4 subsegments. In each subsegment, at least one, but not all, of 
the consecutive fields of views is then deleted from the subsegment 67. For 
example, if all of the fields of view are deleted from each subsegment except for 
one, then stop segment 102 would appear 'as segment 802 in Fig. 8. The 
correction factor in a stop segment is a predetermined value, for example, 0.1, 
which indicates that the instantaneous scanning speed in the stop sesment was 
very slow. 
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Reverting to Fig. 6, the correction factor at a location not comprising an 
essentially vertical stack and not comprising a stop segment that has been 
processed as above, is determined by one of the following methods 68. (i) Two 
essentially vertical stacks are found flanking the location on different sides. The 
correction factor is then determined at the location between the two essentially 
vertical stacks by interpolation of the correction factors at the two flanking 
essentially vertical stacks, (u) Two stop segments are found flanking the location 
on different sides. The correction factor is then determined at the location 
between the two stop segments by interpolation of the correction factors at the 
two flanking stop segments, (iii) A stop segment and an essentially vertical stack 
are found flanking the location on different sides. The correction factor is then 
determined at the location between the stop segment and the essentially vertical 
stack by interpolation of the correction factors at the stop segment and the 
essentially vertical stack. 

Once the correction factors during -scanning of the text have been 
determined, the text image is. rectified for distortions arising from variation in the 
scanning speed 69. This is accomplished by deleting fields of view at each 
location where the correction factor is less than one so as to achieve a rescaling 
equal to the correction factor, as is known per se. When an essentially vertical 
stack is rectified in this way, it resembles the essentially vertical line in the text 
image which it represents in the scanned image. Fig. 8 shows the acquired text 
image shown in Fig. 7 after having been corrected. 

While the invention has been described with a certain degree of 
particularity, it will be immediately apparent to those versed in the art that the 
method may be varied without departing from the scope of the invention. In 
particular, the order in which the steps of the method may be carried out may be 
changed without departing from the scope of the invention as defined in the 
following claims. 



